Low-bit-rate Speech Coding
نویسنده
چکیده
Low-bit-rate speech coding, at rates below 4 kb/s, is needed for both communication and voice storage applications. At such low rates, full encoding of the speech waveform is not possible; therefore, low-rate coders rely instead on parametric models to represent only the most perceptually-relevant aspects of speech. While there are a number of different approaches for this modeling, all can be related to the basic linear model of speech production, where an excitation signal drives a vocal tract filter. The basic properties of the speech signal and of human speech perception can explain the principles of parametric speech coding as applied in early vocoders. Current speech modeling approaches, such as mixed excitation linear prediction, sinusoidal coding, and waveform interpolation, use more sophisticated versions of these same concepts. Modern techniques for encoding the model parameters, in particular using the theory of vector quantization, allow the encoding of the model information with very few bits per speech frame. Successful standardization of low-rate coders has enabled their widespread use for both military and satellite communications, at rates from 4 kb/s all the way down to 600 b/s. However, the goal of tollquality low-rate coding continues to provide a research challenge. This work was sponsored by the Defense Advanced Research Projects Agency under Air Force Contract FA8721-05-C-0002. Opinions, interpretations, conclusions, and recommendations are those of the authors and are not necessarily endorsed by the United States Government.
منابع مشابه
A study on the recognition of low bit-rate encoded speech
Digital speech communications are the future trend in the Internet and mobile phones. The low bit-rate coding of speech signals is the essential requirement in the concern of channel bandwidth and transmission efficiency. The voice-based services will become more attractive to the service providers. Many voice-driven applications require that users must be authorized and able to be identified. ...
متن کاملLow Rate Speech Coding Using Contour Quantization
Vector quantization-based approaches to speech coding have generated new interest in very low bit rate speech coding, that is, speech coded to bit rates below 1200 bits/sec. To achieve such low bit rates, it is necessary to quantize the pitch and energy parameters at rates below 100 bits/sec. Contour quantization is introduced as a technique in which the contour of a given parameter is normaliz...
متن کاملLow bit rate wideband WI speech coding
This paper investigates Waveform Interpolation (WI) applied low bit rate wideband speech coding. An analysis of the evolutionary behaviour of wideband Characteristic Waveforms (CWs) shows that direct application of the classical WI algorithm may not be appropriate for wideband speech. We propose a modification whereby CW quantisation is performed using classical WI decomposition for the low fre...
متن کاملSpeech Compression Using Linear Predictive Coding
The aim of the project is to develop a system for encoding good quality speech at a low bit rate. To implement this we have used most powerful speech analysis technique called Linear Predictive Coding (LPC). It uses 10 order Levinson-Durbin Recursion algorithm to accomplish the task. It provides extremely accurate estimates of speech parameters, and is relatively efficient for computation.The s...
متن کاملLow Bit Rate Speech Coding via TCVRQ
We present a new Trellis Coded Vector Residual Quantizer (TCVRQ) that combines trellis coding and vector residual quantization. We introduce new methods for computing quantization levels and experimentally analyze the performances of our TCVRQ in the case of speech coding at very low bit rates. The results obtained show that transparent quantization of Linear Prediction (LP) parameters can be p...
متن کاملSpeech Compression of Thai Dialects with Low-Bit-Rate Speech Coders
Problem statement: In modern speech communication at low bit rate, speech coding deteriorates the characteristics of the coded speech significantly. Considering the dialects in Thai, the coding quality of four main dialects spoken by Thai people residing in four core region including central, north, northeast and south regions has not been studied. Approach: This study presents a comparative st...
متن کامل